SO(3)‐Pose: SO(3)‐Equivariance Learning for 6D Object Pose Estimation

نویسندگان

چکیده

6D pose estimation of rigid objects from RGB-D images is crucial for object grasping and manipulation in robotics. Although RGB channels the depth (D) channel are often complementary, providing respectively appearance geometry information, it still non-trivial on how to fully benefit two cross-modal data. From simple yet new observation, when an rotates, its semantic label invariant while keypoint offset direction variant pose. To this end, we present SO(3)-Pose, a representation learning network explore SO(3)-equivariant SO(3)-invariant features estimation. The facilitate learn more distinctive representations segmenting with similar channels. communicate deduce (missed) detecting keypoints reflective surface channel. Unlike most existing methods, our SO(3)-Pose not only implements information communication between channels, but also naturally absorbs SO(3)-equivariance knowledge images, leading better learning. Comprehensive experiments show that method achieves state-of-the-art performance three benchmarks. Code available at https://github.com/phaoran9999/SO3-Pose.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning 6D Object Pose Estimation Using 3D Object Coordinates

This work addresses the problem of estimating the 6D Pose of specific objects from a single RGB-D image. We present a flexible approach that can deal with generic objects, both textured and texture-less. The key new concept is a learned, intermediate representation in form of a dense 3D object coordinate labelling paired with a dense class labelling. We are able to show that for a common datase...

متن کامل

On Evaluation of 6D Object Pose Estimation

A pose of a rigid object has 6 degrees of freedom and its full knowledge is required in many robotic and scene understanding applications. Evaluation of 6D object pose estimates is not straightforward. Object pose may be ambiguous due to object symmetries and occlusions, i.e. there can be multiple object poses that are indistinguishable in the given image and should be therefore treated as equi...

متن کامل

Deep Learning of Local RGB-D Patches for 3D Object Detection and 6D Pose Estimation

We present a 3D object detection method that uses regressed descriptors of locally-sampled RGB-D patches for 6D vote casting. For regression, we employ a convolutional auto-encoder that has been trained on a large collection of random local patches. During testing, scene patch descriptors are matched against a database of synthetic model view patches and cast 6D object votes which are subsequen...

متن کامل

The Best of Both Worlds: Learning Geometry-based 6D Object Pose Estimation

We address the task of estimating the 6D pose of known rigid objects, from RGB and RGB-D input images, in scenarios where the objects are heavily occluded. Our main contribution is a new modular processing pipeline. The first module localizes all known objects in the image via an existing instance segmentation network. The next module densely regresses the object surface positions in its local ...

متن کامل

6D Object Pose Estimation with Depth Images: A Seamless Approach for Robotic Interaction and Augmented Reality

To determine the 3D orientation and 3D location of objects in the surroundings of a camera mounted on a robot or mobile device, we developed two powerful algorithms in object detection and temporal tracking that are combined seamlessly for robotic perception and interaction as well as Augmented Reality (AR). A separate evaluation of, respectively, the object detection and the temporal tracker d...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computer Graphics Forum

سال: 2022

ISSN: ['1467-8659', '0167-7055']

DOI: https://doi.org/10.1111/cgf.14684